Indicators Ratings Parallel =  11 ms
Indicators Ratings =  10 ms
CPU compute ratings: 1 iteration avg time: 2146 ms
 
 ---- 
cuda_malloc<d_ratings> 0.83
cuda_malloc<d_companies_map> 0.71
kernel<computeRatingsKernel> 1.18
kernelSegment<sort> 3.68
toHostMemory<d_ratings> 1.17
toHostMemory<d_companies_map> 1.13
ranking 2.14
TotalRankingTime 11.22

cuda_malloc<d_ratings> 0.27
cuda_malloc<d_companies_map> 0.08
kernel<computeRatingsKernel> 1.15
kernelSegment<sort> 1.13
toHostMemory<d_ratings> 1.41
toHostMemory<d_companies_map> 1.11
ranking 2.17
TotalRankingTime 7.64

cuda_malloc<d_ratings> 0.25
cuda_malloc<d_companies_map> 0.09
kernel<computeRatingsKernel> 1.15
kernelSegment<sort> 1.09
toHostMemory<d_ratings> 2.08
toHostMemory<d_companies_map> 1.92
ranking 2.92
TotalRankingTime 9.75

GRID K520 :  797.000 Mhz   (Ordinal 0)
8 SMs enabled. Compute Capability sm_30
FreeMem:   3908MB   TotalMem:   4096MB   64-bit pointers.
Mem Clock: 2500.000 Mhz x 256 bits   (160.0 GB/s)
ECC Disabled


